A Nonparametric Valley-Seeking Technique for Cluster Analysis

نویسندگان

  • Warren L. G. Koontz
  • Keinosuke Fukunaga
چکیده

The problem of clustering multivariate observations is viewed as the replacement of a set of vectors with a set of labels and representative vectors. A general c r i ter ion for clustering is derived as a measure of representation error. Some special cases are derived by simplifying the general c r i te r ion . A general algorithm for f inding the optimum classi f icat ion with respect to a given cr i ter ion is derived. For a part icular case, the algorithm reduces to a repeated applicat ion of a straightforward decision rule which behaves as a valley-seeking technique. Asymptotic properties of the procedure are developed. Numerical examples are presented for the f i n i t e sample case.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust nonparametric descriptors for clustering quantification in single-molecule localization microscopy

We report a robust nonparametric descriptor, J′(r), for quantifying the spatial organization of molecules in singlemolecule localization microscopy. J′(r), based on nearest neighbor distribution functions, does not require any parameter as an input for analyzing point patterns. We show that J′(r) displays a valley shape in the presence of clusters of molecules, and the characteristics of the va...

متن کامل

Distribution Free Decomposition of Multivariate Data SPR'98 Invited submission

We present a practical approach to nonparametric cluster analysis of large data sets. The number of clusters and the cluster centers are automatically derived by mode seeking with the mean shift procedure on a reduced set of points randomly selected from the data. The cluster boundaries are delineated using a k-nearest neighbor technique. The proposed algorithm is stable and e cient, a 10000 po...

متن کامل

Extracting Prior Knowledge from Data Distribution to Migrate from Blind to Semi-Supervised Clustering

Although many studies have been conducted to improve the clustering efficiency, most of the state-of-art schemes suffer from the lack of robustness and stability. This paper is aimed at proposing an efficient approach to elicit prior knowledge in terms of must-link and cannot-link from the estimated distribution of raw data in order to convert a blind clustering problem into a semi-supervised o...

متن کامل

Neighbor number, valley seeking and clustering

This paper proposes a novel nonparametric clustering algorithm capable of identifying shape-free clusters. This algorithm is based on a nonparametric estimation of the normalized density derivative (NDD) and the local convexity of the density distribution function, both of which are represented in a very concise form in terms of neighbor numbers. We use NDD to measure the dissimilarity between ...

متن کامل

Evaluation of seasonal variability in surface water quality of Shallow Valley Lake, Kashmir, India, using multivariate statistical techniques

Seasonal variation in water quality of Anchar Lake was evaluated using multivariate statistical techniques- principal component analysis (PCA) and cluster analysis (CA). Water quality data collected during 4 seasons was analyzed for 13 parameters. ANOVA showed significant variation in pH (F3 = 10.86, P < 0.05), temperature (F3 = 65, P

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1971